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1.  Introduction  and  Summary 

The  failure  rate  function  h  is  an  important  parameter  in  re¬ 
liability  and  biometry.  Estimates  of  h  using  weighting  functions  or 
"kernels"  are  quite  common  in  the  literature  [see  Rice  and  Rosenblatt 
(1975)  and  related  references].  The  kernels  that  have  been  considered 
so  far  are  nonnegative  and  absolutely  integraJyle  in  (-00,00)  .  (Kernels 

satisfying  this  latter  condition  are  known  as  L*  kernels.)  In  Sing¬ 
purwalla  and  Wong  (1980) — abbreviated  as  SW(1980) — we  have  shown  that 
the  mean  square  error  (MSE)  of  a  kernel  estimator  of  h  using  an 

L^  kernel  restricted  to  be  nonnegative  has  a  rate  of  convergence  of  at 
-A/5 

most  0(n  )  ,*  regardless  of  the  smoothness  of  h  ;  where  n  is  the 

sample  size.  If  the  nonnegativity  condition  of  an  L^  kernel  is  re¬ 
laxed,  and  if  h  is  m  times  continuously  differentiable,  then  (for 
m>2),  the  rate  of  convergence  of  the  MSE  (can  be  improved  and)  is  at 

*The  notation  "a  =  0(b  )"  denotes  the  fact  that  la  /b  |  is 
n  n  1  n  n ' 

bounded  in  the  limit. 
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2tq 
2u|  |  ^ 

most  0(n  )  .  A  method  for  producing  kernel  estimators  having  the 

above  property  is  the  generalized  jackknife  of  Gray  and  Schucany  (1972). 
Specifically,  if  we  use  the  generalized  jackknife  on  two  kernel  estima¬ 
tors  of  h  ,  with  each  estimator  being  based  upon  a  nonnegative  ker¬ 

nel,  then  this  is  equivalent  to  directly  producing  a  kernel  estimator  of 

h  using  an  kernel  which  takes  both  positive  and  negative  values. 

If  we  continue  to  apply  the  generalized  jackknife  method,  then  the  rate 
of  convergence  of  the  MSE  of  the  resulting  estimator  can  be  brought  as 

close  to  n  ^  as  is  desired.  This,  plus  the  alternating  behavior  of 
the  kernel  discussed  in  Example  5.3  of  SW(1980),  has  prompted  us  to  con¬ 
jecture  that  an  indefinite  jackknifing  of  estimators  based  on  ker¬ 

nels  is  equivalent  to  obtaining  a  kernel  estimator  using  an  alternating 

(wave-like)  non  kernel. 

Motivated  by  the  above  considerations,  out  goal  in  this  paper  is 
to  obtain  an  estimator  of  h  whose  MSE  converges  to  0  faster  than 
2m 

0(n  2m+1)  for  any  finite  m > 0  ,  and  preferably  is  closer  to  the  ideal 

n  1  .  We  achieve  this  goal  by  considering  a  kernel  estimator  of  h 
based  on  the  "sine"  kernel,  illustrated  in  Figure  3.1.  In  Section  3  we 

show  that  the  sine  kernel,  which  is  not  an  kernel  and  may  not  be  a 

limiting  case  of  jackknifing  an  kernel  either,  arises  naturally 

when  we  estimate  h  via  an  estimate  of  the  Fourier  transform  of  h  . 

The  sine  kernel  estimator  of  h  is  also  referred  to  as  the  "Fourier 
integral  estimate." 

In  Section  4  we  show  that  the  sine  kernel  estimators  of  h  are 
asymptotically  unbiased  and  consistent.  In  Section  5,  we  discuss  the 
rates  of  convergence  of  the  bias  and  the  MSE  of  these  estimators.  We 
show  that  for  certain  classes  of  failure  rate  functions,  the  sine  kernel 
estimators  have  a  faster  rate  of  convergence  of  the  MSE  than  the 
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corresponding  L  kernel  estimators.  These  rates  are  of  the  order 

(log  n/n)  or  (n  "  )  ,  depending  upon  whether  the  Fourier  transform 

of  h  decreases  ’’exponentially"  or  "algebraically  with  degree  p"  (see 
Definitions  5.1  and  5.2).  Clearly,  when  p  >  m+1  ,  both  the  above 

2m 

rates  are  faster  than  n  .  However,  they  are  not  equal  to  the  ideal 

rate  of  n  *  ,  which  we  know  can  be  attained,  if  possible,  by  jackknif¬ 
ing  indefinitely. 

Sine  kernels  have  been  considered  before  in  the  literature  on 
density  estimation.  Davis  (1975)  has  shown  that  under  certain  condi¬ 
tions  density  estimates  based  on  the  sine  kernels  have  a  faster  rate  of 

convergence  of  the  MSB  than  those  based  on  kernels.  Thus,  the  re¬ 

sults  of  our  paper  complement  those  of  Davis.  An  explanation  of  why  the 
sine  kernel  gives  us  faster  rates  of  convergence  of  the  MSE  lies  in  the 
effect  of  jackknifing  on  kernels.  Typically,  the  generalized  jackknife 
method  is  undertaken  to  improve  upon  the  rate  of  convergence  of  the  bias 

and  the  MSE,  and  an  indefinite  jackknifing  using  L1  kernels  leads  us 

to  a  non  L*  kernel,  which  alternates  between  positive  and  negative 
values,  like  the  sine  function. 

2.  Preliminaries:  Kernel  Estimates 

Suppose  that  the  time  to  failure  of  a  device  is  a  nonnegative 
random  variable  X  ,  with  an  absolutely  continuous  distribution  function 
F  and  a  probability  density  function  f  .  The  failure  rate  at  x^  , 

h(Xg)  ,  for  F(xq)  j*  1  ,  is  defined  as 

f(x0) 

h<V  1  -  F(x0)  ’ 

note  that  h(x)  >  0  ,  for  all  x  >  0  . 
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Given  an  ordered  sample  of  n  lifetimes  from  F 


say  X(l)’ 


a  kernel  estimate  of  h(xQ) 


h(n,XQ>  ,  is  defined  as 


h(n,xQ) 


n 


l 

j=i 


i  i 
n-j+l  b(n) 


(2.1) 


where  the  kernel  K  is  a  bounded,  symmetric  function  of  integral  one; 
the  scale  parameter  b(n)  is  a  nonnegative  decreasing  function  of  n 
such  that 


(i)  lim  b(n)  =  0  ,  and 

n->oo 

(2.2) 

(ii)  lim  n  b(n)  =  °°  . 
n-*» 


A  motivation  for  considering  the  kernel  estimates  of  the  failure 
rate  are  given  in  Watson  and  Leadbetter  (1964a).  Some  of  the  commonly 
used  kernels  are  the  rectangular,  the  triangular,  the  Weierstrass,  the 


Piccard,  the  Cauchy,  and  the  kernel  K(x) 


-£( 


sm 


(x/2) X 


Among 


(x/2)  / 

other  things,  these  kernels  have  the  following  two  features  which  are  of 
interest : 


(i)  they  are  nonnegative,  and 

(ii)  they  are  absolutely  integrable  in  (-“>,°°)  ;  kernels 
which  satisfy  this  property  are  called  L1  kernels. 


Watson  and  Leadbetter  (1964b)  have  shown  that  for  a  certain  class 

of  distribution  functions,  estimates  based  on  kernels  are  asymp¬ 

totically  unbiased  and  consistent,  at  every  point  x  at  which  h  is 
continuous  and  F(x)  <  1  .  The  optimal  rates  of  convergence  of  the  bias 
and  the  MSE  of  h(n,x^)  have  been  discussed  by  SW(1980). 


3.  Kernel  Estimates  Based  on  the  Fourier  Integral 

We  shall  confine  our  attention  to  the  class  of  failure  rate  func¬ 
tions  h  for  which  the  Fourier  transform  exists;  that  is 
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4>  (x)  =  /elxu  h(u)du  <  “  • 
n 


Let  Xq  be  a  point  of  continuity  of  h(x)  ,  and  assuming  that 

4>  e  (i.e.,  /|$>  (x)  Idx  <  °°)  ,  the  following  inversion  formula  gives 

h  h 

us  the  basis  for  considering  the  Fourier  integral  estimate  of  the 
failure  rate: 


h(x0) 


-ix0u 


(u)du  . 
h 


(3.1) 


Let  F^  be  the  modified  sample  distribution  funetion ;  that  is, 

the  usual  sample  distribution  function  multiplied  by  n/(n+l)  .  An  es¬ 
timate  of  h(x)  at  x  =  X..,  ,  h  (x)  ,  is 

(/)•>  n 


hn(x) 


f  (x)  dF  (x)  . 

n _  _  n _  _  1 

1  -  F  (x)  “  1  -  F  (x)  _  n-j+1  * 

n  n 


Let  <J>  be  the  Fourier  transform  of  h  ;  that  is, 
h  n 


n  ixX,.. 

$,  (x)  =  felxu  h  (u)du  =  l  ~~  e  .  (3.2) 

h  n  '  .  .  n-j+3 

n  J=1 


To  obtain  from  (3.1)  an  estimate  of  h(x^)  ,  we  replace  by 

$  ,  and  to  assure  finiteness  of  the  integral,  we.  take  it  between  the 

h 

n 

finite  limits  (-  >  bTn)^  ’  w*iere  t'ie  b(n)  satisfy  (2.2),  we  ob¬ 
tain  the  Fourier  integral  estimator  of  h(x^)  ,  h(n,x0)  ,  where 


h(n,xQ) 


(3.3) 


Replacing  by  its  expression,  (3.2),  we  have 

n 
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Thus  we  see  that  the  Fourier  integral  estimate  of  the  failure 
rate  is  indeed  a  kernel  estimate,  with  the  sine  function  S  as  the 
kernel. 


Note  that  the  kernel  S  is  not  an  kernel,  but  that  it  is 

2  1 

symmetric,  bounded,  and  of  integral  one;  also  / S  (x)dx  =  —  . 

4.  Asymptotic  Unbiasedness  and  Consistency 

Since  S  is  not  an  L"*-  kernel,  the  asymptotic  unbiasedness  and 

consistency  of  h(n,Xg)  has  to  be  established  first.  Once  this  is  done, 

we  will  be  able  to  discuss  the  rates  of  convergence  of  the  bias  and  the 

MSE. 

Theorem  4.1:  Let  <  X,„v  <  ...  <  X,  v  be  an  ordered  sample  of 

- (1)  -■=  (2)  -=  (n)  * 

lifetimes  from  an  absolutely  continuous  distribution  function  F  . 

Suppose  that: 

(i)  the  failure  rate  function  h  is  absolutely  inte- 
grable*; 

(ii)  h  satisfies  Dirichlet's  conditions  in  any  finite 
interval;  that  is,  h  has  at  most  a  finite  number 
of  finite  discontinuities,  and  no  infinite  discon¬ 
tinuities  in  any  finite  interval,  and,  furthermore, 
h  has  only  a  finite  number  of  maxima  and  minima 
in  any  finite  interval; 

(iii)  h(x)  is  continuous  at  x^  ; 

(iv)  F(xQ)  <  1  ;  and 


*When  h  is  absolutely  integrable,  F  is  a  subdistribul ion 
function  [Chung  (1974,  p.  84)]. 
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=  f 


jx  (A)  o-(“»3'1«-'i<u»n-i+1h(u)d„ 


(u)  ]du 


.  1  ,  Sin(Sf)  -  1  , 


f  - h(u)du  _±  f  - h(u)F“(u)du  . 

TT  U  “  Xq  IT  11  -  Xq 


(4.2) 


Consider  the  limit  of  the  first  term  on  the  right-hand  side  of 


(4.2): 


1  sin(b0?) 

lim  —  J  — — — - — 


n-x» 


u-  x„ 


h(u)du  = 


li.  i  /  — ^  h(»)du 


b(n)-K) 


.  sin[A(u-xn)] 

=  lim  —  / - — - —  h(u)du 

.  ir  u  -  x,. 

X-*"  0 


=  h(xQ)  . 


(4.3) 


The  last  equality  follows  by  the  Fourier  integral  formula  [see  Titch- 
march  (1962,  pp.  3,  25)  or  Smirnov  (1964,  pp.  462-472)]. 


Next  we  show  that  the  second  term  on  the  right-hand  side  of  (4.2) 
tends  to  zero  geometrically,  as  n-*00  .  Since  F(x^)  <  1  ,  we  can  choose 

a  X  >  0  so  that  F(x^+X)  <  1  ,  and  such  that  h(u)  is  bounded  in 

(t.-Xgl^X  .  We  split  the  interval  of  integration  (-00,00)  into  two 

parts,  |u-Xq|<X  and  |u-Xq|  >X  ,  and  note  that 


1  *ln(bTnf) 


--  h(u)Fn(u)du  <  (const)F°(xn+X)  ->  0  ,  (4.4) 

i  i  . tt  u  -  x_  --  u 

|u-x0|<X  0 


as  n  ^ 00  ,  and 
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a 

n 


» 


and  take  the  limit  as  n-»°°  ,  we  note  that  the  first  term  on  the  right- 
hand  side  of  (4.6)  equals  bCx^) / (l-F(x^))  whereas  the  second  term  is 
0  .  Thus 


lim  ~  Var[h(n,xQ) ] 
n-*~  n 


h(xQ) 

1  -  F(x0) 


or  that 


Var [h(n,xQ> ] 


an  h(x0) 
n  1  -  F(Xq) 


Since 

,  1  „2  /U~Xo\  ,  _  1 

^  =  b2(n)  \b(n)/  " 

implies  o^/n  =  (1/w)  (l/nb(n) )  -+  0  ,  by  (2.2),  and  thus  Var[h(n,xQ)]  -»  0 
Thus,  h(n,Xp)  is  a  consistent  estimator  of  h(xQ)  ,  and  the  variance  of 
h(n,Xp)  goes  to  zero  at  the  rate  l/nb(n)  .  // 


4 . 1  An  alternate  expression  for  the  bias 

We  shall  find  it  useful  to  express  the  asymptotic  bias  of  h(n,xQ) 

in  terms  of  the  Fourier  transform  <J>,  of  h  .  We  first  note  that  if 

n 


(i  .  W<i 

w(x)  =  <  , 

(O  ,  |x  |  >  1 


then  the  inverse  Fourier  transform  of  w(x)  is 

1  ,  -ixu  ...  1  r  1  -ixu .  sin  x 

2i  1  e  w(u)du  2tt  -1  e  du - 


That  is. 


800  =  f  e  ixU  w(u)du  . 


(4.7) 
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i 


! 


t 

I 

I 

I 


In  view  of  the  above,  the  Fourier  transform  of  S(x)  is  w(x)  ;  that  is 

i  ,  |*|<i 

r  1XU  v,  _  r  J‘  '  " 

/  e  S(u)du  -  J  e 


ixu  -/ ..N,.  r  Sin^u  du  =  ,0  >  |  x  |  >  1  . 


HU 


>2  ,  |x|  =  1 


Recall,  from  (4.1),  that 


du 


b(n) 


I  h(u) 


LH/e 


./xcfu\  1 

^b<n>'V  w(v)dvj  du  , 


by  (4.7) 


//  h(u)  e  U  w(b(n)t)  b(n)dt  du 


-ixnt 

—  /  e  w(b(n)t)  /  e1U  h(u)  du  dt 

2h 


^  f  e  1X°C  w(b(n) t)  «>h(t)  dt 


The  asymptotic  bias  of  h(n,xQ)  is  therefore  given  by 

1  _ix0t 

Bias[h(n,xQ) ]  ~  f  e  w(tb(n))  $^(t)  dt  -  h(xQ) 


1  *x0t  1  *xot 

~  /  e  U  w(tb(n) )  <i>h(t)  dt  -  ■—  /  e  U  <I>h(t)  dt 


-ix  t 

=  ~  /  e  U  (w(tb(n) )  -  1}  4>h(t)  dt 


1  "1*nt 

=  -■!-/  .  e  <J>.  (t)  dt  . 

2tt  1 1 1  > _ i _  h 

'*•  b(n) 


(A. 8) 


5.  Rates  of  Convergence  of  the  Bias  and  the  MSE 

We  are  able  to  investigate  and  optimize  the  rates  of  convergence 
of  the  bias  and  the  MSE  of  h(n,xQ)  when  the  Fourier  transform  of  the 

failure  rate  decreases  exponentially  or  algebraically. 
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Definition  5.1  [Parzen  (1958)1:  A  function  g(x)  is  said  to  decrease 
exponentially  with  degree  0<r<:2  ,  and  coefficient  p>0  ,  if 

|g(x) |  <  Ae  for  some  constant  A>0  ,  (5.1) 


lim  /q  £l  +  exp(2pxr)  jg(xu)|^J  du  =  0  . 


(5.2) 


We  shall  first  need  to  prove  the  following  lemmas. 


Lemma  5.1: 


..  ...  br(n)  f"  -tr 

lim  b(n)  e  1  e 

n^°°  bOO 


dt  =  0  ,  (r  >  0) 


(5.3) 


Proof:  The  right-hand  side  of  the  above  equation,  when  X  =  l/b(n)  ,  is 
°°  .r 

/.  e  dt  -Ar 

lim  - _  =  lim  -  =  lim  — -± -  =  0  .  // 

n-vx.  -xr  A-**  -Xr  .r  -Xr  A^°  rXr  -  1 

Xe  e  -  rX  e 


Lemma  5.2:  If  the  Fourier  transform  of  h  ,  0^  ,  decreases  exponen¬ 

tially  with  degree  r  and  coefficient  p  ,  then,  for  sufficiently 
large  n  , 

/OO  t 

j  Ae  Pt  dt  .  (5.4) 


Proof:  From  (4.8),  we  note  that  for  n  large 


Bias [h( 


"•vi  '  -H,  >a- 


-ix  t 

e  U  $h(t)dt 


However , 
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6 


-hJL  i  e  V^'  5s/t|>_L.  'V01'* 

l*'  b(iO  b(n) 

s  ^  /tl  >_i_  Ae"P,t  ^  dt  ■  ;  /t>_j_Ae'ccr<lt  1 


the  statement  of  the  lemma  now  follows. 


Lemma  5.3:  Suppose  that  the  Fourier  transform  of  h  decreases  ex¬ 

ponentially  with  degree  r  and  coefficient  p  .  Then 

lim  b(n)  e^^  ^  (Bias [h(n,xn) ] |  *  0  ;  (5.5) 


Bias [h( 


v,  (.-1,  v  -p/br(n))  . 

n,xQ)]  =  o\b  (n)  e  / 


X  X 

Proof :  The  result  follows  if  we  make  a  change  of  variable  u  =  pt  , 
and  use  Lemmas  5.1  and  5.2.  Ii 


The  following  theorem  establishes  the  choice  of  b(n)  which 
enables  us  to  obtain  the  optimal  rate  of  convergence  of  the  mean  square 
error  of  h(n,Xg)  ,  when  4*^  decreases  exponentially. 


Theorem  5. A:  Suppose  that  the  Fourier  transform  of  the  unknown 

failure  rate  h  exists  and  decreases  exponentially  with  degree  0<r,<2 
and  coefficient  p>0  .  Then,  if  b(n)  in  the  Fourier  integral  estimat 
or  of  h  ,  h(n,XQ)>  given  by  (3.4),  is  chosen  such  that  b(n)  = 

o(log  n/2p)  t  the  optimal  rate  of  convergence  of  the  MSE  of 

h(n,Xg)  is  of  the  order  log  n/n  . 


*The  notation  "a  =  o(b  )”  denotes  the  fact  that  the  ratio  of 
n  n 

a  to  b  has  limit  0  . 
n  n 
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2  ~ 

Proo f :  From  Lemma  5.3,  we  note  that  Bias  [h(n,Xg)]  decreases  at  least 
2  r 

as  fast  as  1/b  (n)  exp(-2p/b  (n))  .  From  Theorem  4.1,  we  have  the  re¬ 
sult  that  the  variance  of  h(n,Xg)  goes  to  zero  at  the  rate  (nb(n))  * 
The  statement  of  the  theorem  now  follows  from  Davis  (1974).  //. 

We  shall  now  consider  the  class  of  failure  rate  functions  h 
whose  Fourier  transforms  decrease  algebraically. 

Definition  5.2  [Parzen  (1958)]:  A  function  g(x)  is  said  to  decrease 
algebraically  with  degree  p  >  0  ,  if 

lim  |x|P  |g(x) j  =  a2  >  0  ,  (5.6) 

|  x  j  "*°° 

for  some  a  >  0  . 


Lemma  5.5:  Suppose  that  the  Fourier  transform  4>  of  h  decreases  al- 
-  n 


gebraically  with  degree  p  > 1  .  Then 


lia  b1“P(n)  /  K(t)|dt  =  2a\p-l)_1 

n"°°  |t!>b00 


(5.7) 


Proof:  From  (5.6),  we  note  that  for  e>0  ,  there  exists  an  M>0  such 
that 

-  e  <  |t  |p  I^Ct)  |  <  a"5  +  e  , 

whenever  j t  j  >  M  .  Equivalently, 

) t |~p (cx^  —  e)  <  |<t>h(t)|  <  jtfp(a^  +  e)  , 

for  1 1 1  >  M  .  Integrating  both  sides  of  the  above  for  |t|  >  l/b(n)  , 
we  get 

Ji 


2(0*-  e)(p-l)"1  bp_1(n)  <  f  1  1 4>h ( t )  |d 

^ ^  b(n) 


<  2(a!j+e)(p-l)'1  bp-1(n)  ,  jt  |  >  M 
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Note  that  when  n  is  sufficiently  large,  l/b(n)  >  M  ,  and 
|t|  >  l/b(n)  implies  that  |t|  >M  .  Thus,  for  sufficiently  large  n  , 


2(ah-  e) (p-1)-1 


b1  P(n) 


l\(t)  |dt 


2(a  2+  e) (p-1) 


We  therefore  have 


lim  b1-p(n)  /  (t)|dt  =  2oi(p-l)  1 

At|>^  " 


Lemma  5.6:  Suppose  that  the  Fourier  transform  4>,  of  h  decreases  al- 
-  n 

gebraically  with  degree  p  > 1  .  Then  the  bias  of  h(n,x^)  , 

Bias [h(n,XQ) ]  ,  satisfies 

lim  b1  p(n)  | Bias [h (n,x~) ] |  <  a 2  tt  1  (p-1)  1  ;  (5.8) 

n-*» 

thus  the  bias  decreases  at  the  rate  bP  \n)  . 

Proof :  From  (4.8),  we  have 

b1  P(n)  |Bias[h(n,xQ)] |  =  b1  P(n) 

5  bl~P<">H/  ,  ,  • 

,t:|  b(n) 

Using  (5.7),  we  obtain 

lim  b1  P(n)  (Bias  [h(n,xQ)  ]  |  <  —  2a'5(p-l)  1  =  ir  1  (p-1)  1  .  // 

n*°° 


~  f 

2lT  '|t|>A- 


<I>h(t)dt 


The  following  theorem  is  analogous  to  Theorem  5.4. 
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Theorem  5.7:  Suppose  that  the  Fourier  transform  of  the  unknown 

-  n 

failure  rate  h  exists  and  decreases  algebraically  with  degree  p  >  1  . 
Then,  if  b(n)  in  the  Fourier  integral  estimator  of  h  ,  h(n,XQ>  , 

given  by  (3.4),  is  chosen  such  that  b(n)  =  0(n  ^^P  ,  the  optimal 

rate  of  convergence  of  the  MSE  of  h(n,Xg)  is  of  the  order  n^^P  ^  * 


2  ~ 

Proof:  From  Lemma  5.6,  we  note  that  Bias  [h(n,x^)]  decreases  at  the 
rate  of  b  K  (n)  ,  and  from  Theorem  4.1  the  variance  of  h(n,x^)  goes 


to  zero  at  the  rate  (nb(n)) 


-1 


Thus,  a  choice  of  b(n)  =  0(n 


gives  an  optimal  rate  of  convergence  of  MSE  of  the  order  n 


-1 +  1/ (2p-l) 
//. 


6.  A  Comparison  of  the  Rates  of 
Convergence  of  the  MSE's 

We  can  now  compare  the  optimal  rates  of  convergence  of  the  MSE's 
for  estimates  of  h  based  on  kernels  and  the  sine  function  kernel 
which  is  not  an  kernel. 


In  general,  for  L  kernels  which  belong  to  the  class  A 

m 

(i.e.,  an  L^  kernel  K  which  satisfies  the  condition  that 

/x^K(x)dx  =  0  ,  for  j=l,2, . . . ,m-l) ,  and  if  h^m^  exists  (that  is, 
if  h  is  m  times  continuously  differentiable),  then  we  have  shown 
in  SW(1980)  that  the  optimal  rate  of  convergence  of  the  MSE  of  the  ker¬ 
nel  estimator  of  h  is  of  the  order  n  2m/(2m+l)  ^ 


In  Theorem  5.4  of  this  paper,  we  have  shown  that  if  the  Fourier 
transform  of  h  ,  $  ,  decreases  exponentially  with  degree  r  and  co¬ 

efficient  p  ,  then  the  optimal  rate  of  convergence  of  the  MSE  of  the 
Fourier  integral  estimator  of  h  is  of  the  order  log  n/n  .  Since 
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Since  |4>^(t)  )  =  e  C  decreases  exponentially  with  A=1  , 

p=l  ,  and  r  = -2  .  By  Theorem  6.1,  for  this  h  the  Fourier  integral 
estimate  is  better  than  kernel  estimates  based  on  kernels. 

(ii)  Suppose  that  the  failure  rate  function  h  is  of  the  form 

X_  i  _X 

h(t)  =  — -  t2  e  2  , 

2T/zr(Y/2) 

t>0  ,  Y  a  positive  integer.  For  YX2  ,  this  failure  rate  function 
first  increases  and  then  decreases.  The  Fourier  transform  of  h  , 

\  ’  is 

*h(t)  =  (1-  2it)~Y/2  , 

and 

l*h(t)|  =  (l  +  4t2)"Y/4  , 

implying  that  <J>h  decreases  algebraically  with  degree  y/2  .  By 

Theorem  6.2,  for  this  h  the  Fourier  integral  estimate  of  h  is  better 

than  kernel  estimates  based  on  kernels  which  belong  to  A  ,  when- 

m 

ever  Y  >  2(m+l)  . 

(iii)  Suppose  that  h  is  of  the  form 

h(t)  =  Ae  ^'t  ,  t  >  0  ,  A  >  0  ; 

that  is,  the  failure  rate  is  an  exponential  function.  The  Fourier 

transform  of  h  ,  <J>,  ,  is 

h 


*h(t>  =  * 


lyol  = 


(1+iT- 


implying  that  <J>^  decreases  algebraically  with  degree  1.  By  Theorem 
6.1,  for  this  h  the  Fourier  integral  estimate  is  inferior  to  any  ker¬ 


nel  estimator  based  on  an  1.  kernel. 
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7.  Concluding  Remarks 


It  is  evident  from  Theorems  5.4  and  5.7  that  one  would  consider 
using  the  Fourier  integral  estimate  of  h  only  when  one  had  some  prior 
knowledge  about  the  general  behavior  of  h  .  For  example,  if  we  know 
a  priori  that  the  failure  rate  is  an  exponential  function,  then  we  would 

use  an  kernel  rather  than  a  sine  kernel  to  estimate  h  .  In  real 

life,  this  kind  of  prior  knowledge  may  be  hard  to  come  by;  after  all, 
the  practical  problem  is  to  estimate  an  unknown  h  .  This,  therefore, 
poses  a  disadvantage  of  the  sine  kernel  estimators  of  h  . 

Another  disadvantage  of  the  sine  kernel  estimator  stems  from  the 
fact  that  the  sine  function  takes  negative  value  for  a  substantia]  num¬ 
ber  of  points  in  its  domain.  Thus,  the  sine  function  estimator  of  h 
can  be  negative  at  some  points,  a  result  which  is  unacceptable  to  a 
practitioner.  One  may  argue  that  this  is  the  price  that  must  be  paid 
for  obtaining  an  estimator  which  has  good  bias  and  MSE  properties. 

On  the  other  hand,  a  Bayesian  may  view  this  as  another  situation  wherein 
unbiased  estimation  and  MSE  minimization  lead  us  to  unacceptable 
answers.  Efron  (1978),  in  a  delightfully  written  paper,  discusses  some 
controversies  in  the  foundations  of  statistics. 

Our  final  remark  pertains  to  an  interpretation  as  to  why  the  MSE 
of  the  sine  kernel  estimator  of  h  has  a  faster  rate  of  convergence 
than  those  obtained  via  other  kernels.  In  SW(1980),  we  observe  that  an 

indefinite  jackknifing  of  L'*"  kernel  estimators  of  h  is  equivalent  to 
obtaining  a  kernel  estimator  of  h  using  an  alternating  kernel  that  is 

not  L  .  The  sine  kernel,  being  not  and  wavelike,  can  be  viewed 

as  one  that  could  be  the  consequence  of  an  indefinite  amount  of  jack¬ 
knifing.  Since  the  result  of  jackknifing  is  a  reduction  of  the  bias  and 
an  improvement  of  the  rate  of  convergence  of  the  MSE  [Miller  (1978)], 
the  fast  rates  of  convergence  of  the  MSE  follow. 
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To  cope  with  the  expanding  technology,  our  society  must 
he  assured  ot  a  continuing  supply  ol  rigorously  trained 
and  educated  engineers.  The  School  ot  Engineering  and 
Applied  Science  is  completely  committed  to  this  ob¬ 
jective. 


